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1. 


Introduction. 


\ 


The  following  graph  problem  arises  in  the  study  of  global  flow 

analysis  and  program  optimization  [2,6],  Let  G = (V,  E, r)  be  a flow 

*/ 

graph  with  start  vertex  r A vertex  v dominates  another  vertex 

w ^ V in  G if  every  path  from  r to  w contains  v . Vertex  v 
is  the  immediate  dominator  of  w , denoted  v = idom(w)  , if  v 
dominates  w and  every  other  dominator  of  w dominates  v . 

Theorem  1 [2,6].  Every  vertex  of  a flow  graph  G = (V,  E,  r)  except  r 
has  a unique  immediate  dominator.  The  edges  [ (idom(w), w)  |weV-[r]} 
form  a directed  tree  rooted  at  r , called  the  dominator  tree  of  G , 
such  that  V dominates  w if  and  only  if  v is  a proper  ancestor 
of  w in  the  dcminator  tree.  See  Figures  1 and  2. 

[Figure  1] 

[Figure  2] 


We  wish  to  construct  the  dominator  tree  of  an  arbitrary  flow  graph  G . 
Aho  and  Ullman  [2]  and  Purdom  and  Moore  [7]  describe  a straightforward 
algorithm  for  solving  this  problem.  For  each  vertex  v 5^  r , we  carry 
out  the  following  step: 

General  Step:  Determine,  by  means  of  a search  from  r , the  set  S of 

vertices  reachable  from  r by  paths  which  avoid  v , The 
vertices  in  V-[v}-S  are  exactly  those  which  v dominates. 

Knowing  the  set  of  vertices  dominated  by  each  vertex,  it  is  an  easy  matter 
to  construct  the  dominator  tree. 

*/ 

Appendix  A contains  the  graph-theoretic  terminology  used  in  this  paper. 
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To  analyze  the  running  time  of  this  algorithm,  let  us  assume  that 
G has  m edges  and  n vertices.  Each  execution  of  the  general  step 
requires  0(m)  time,  and  the  algorithm  performs  n-1  executions  of  the 
general  step;  thus  the  algorithm  requires  0(mn)  time  total. 

Aho  and  Ullman  [3]  describe  another  simple  algorithm  for  computing 
dominators.  This  algorithm  manipulates  bit  vectors  of  length  n . Each 
vertex  v has  a bit  vector  which  encodes  a superset  of  the  dominators 
of  V . The  algorithm  makes  several  passes  over  the  graph,  updating 
the  bit  vectors  during  each  pass,  \mtil  no  further  changes  to  the  bit 
vectors  occur.  The  bit  vector  for  each  vertex  v then  encodes  the 
dominators  of  v . 

This  algorithm  requires  0(m)  bit  vector  operations  per  pass  for 
0(n)  passes,  or  0(nm)  bit  vector  operations  total.  Since  each  bit 
vector  operation  requires  0(n)  time,  the  running  time  of  the  algorithm 
is  0(n  m)  . This  bound  is  pessimistic,  however;  the  constant  factor 
associated  with  the  bit  vector  operations  is  very  small,  and  on  typical 
graphs  representing  real  programs  the  number  of  passes  is  small  (on 
reducible  flow  graphs  [3]  only  two  passes  are  required  [1+]''. 

In  this  paper  we  shall  describe  a faster  algorithm  for  solving  the 
dominators  problem.  The  algorithm  uses  depth-first  search  [9]  in 
combination  with  a data  structtire  for  evaluating  functions  defined  on 
paths  in  trees  [13 ].  We  present  a simple  implementation  of  the  algorithm 
which  runs  in  0(m  log  n)  time  emd  a more  sojAiisticated  implementation 
which  runs  in  0(m  a(m,n))  time,  where  a(m,n')  is  a functional  inverse 


of  Ackermann's  function. 


The  algorithm  is  a refinement  of  earlier  versions  appearing  in 
[10,11,12],  Although  proving  its  correctness  and  verifying  its  running 
time  require  rather  complicated  analysis,  the  algorithm  is  quite  simple 
to  program  and  is  very  fast  in  practice.  We  programmed  both  versions  of 
the  algorithm  in  Algol  W,  a Stanford  Uhiversity  version  of  Algol,  and 
tested  the  programs  on  an  IBM  570/168.  We  con^jared  the  programs  with 
a transcription  into  Algol  W of  the  Purdom  - Moore  algorithm  and  with 
an  implementation  of  the  bit  vector  algorithm.  On  all  but  the  smallest 
graphs  tested  our  algorithm  beat  the  other  methods. 

The  paper  consists  of  five  sections.  Section  2 describes  the 
properties  of  depth-first  search  used  by  the  algorithm  and  proves 
several  theorems  which  imply  the  correctness  of  the  algorithm.  Some 
knowledge  of  depth- first  search  as  described  in  [9]  and  Section  2 of  [10] 
is  useful  for  understanding  this  section.  Section  5 develops  the 
algorithm,  using  as  primitives  two  procedures  that  manipxolate  trees. 
Section  4 discusses  two  implementations,  simple  and  sophisticated,  of 
these  tree  manipulation  primitives.  Scrnie  knowledge  of  Sections  1,  2, 
and  5 of  [I3]  is  useful  for  understanding  this  section.  Section  5 
presents  our  experimental  results. 


2. 


Depth-First  Search  and  Dominators. 


Suppose  we  perform  a depth-first  search  on  a flow  graph  G = (V,  E,  r) 
starting  frctn  vertex  r , and  that  we  number  the  vertices  of  G from  1 
to  n as  they  are  reached  during  the  search.  The  search  generates  a 
spanning  tree  T rooted  at  r , with  vertices  nmbered  in  preorder  [5]. 

See  Figure  3. 

[Figure  3] 

The  following  paths  lemma  is  an  important  property  of  depth-first 
search  and  is  crucial  to  the  correctness  of  the  dominators  algorithm. 

Lemma  1 [9].  If  v and  w are  vertices  of  G such  that 
number (v)  < nmberCw)  , then  any  path  from  v to  w in  G must 
contain  a common  ancestor  of  v and  w in  T . 

As  an  intermediate  step,  the  dominators  algorithm  computes  a value 
for  each  vertex  w ^ r called  its  semi-dominator,  denoted  by  sdom(w) 
and  defined  by 

(1)  sdam(w)  = minfnumber(v)  | there  is  a path  v = ■Vq,v^,  = w such 

that  number (v^)  > number(w)  for  1 < i < k-l]  . 

See  Figure  3. 

The  following  lemmas  describe  some  basic  properties  of  semi-dominators 
and  immediate  dominators. 

Lemma  2.  For  any  vertex  w ^ r , let  v be  the  vertex  such  that 


number (v)  a sdom(w)  . Then  v is  a proper  ancestor  of  w in  T . 


Proof.  Let  parent (w)  be  the  parent  of  w in  T . Since  (parent (v) , v)  is 
an  edge  of  G , by  (1)  number  (v)  = sdom(v)  < number  (parent  (v) ) < number(v)  , 
By  (l)  and  the  choice  of  v , there  is  a path  v = = w 

such  that  n'umber(v^)  > number (v)  for  1 < i < k-1  . By  Lemma  1,  some 
vertex  v^  on  the  path  is  a common  ancestor  of  v and  w . But  such 
a common  ancestor  must  satisfy  number (v^^ ) < number  (v)  . This 

means  i=0,  i.e.,  v^=v,  and  v is  a proper  ancestor  of  w . □ 

Lemma  3.  For  any  vertex  t , let  v be  the  vertex  such  that 

number (v)  = sdnm.(w)  . Then  idam('w)  is  an  ancestor  of  v in  T . 

Proof.  The  tree  path  from  r to  w contains  only  ancestors  of  w in  T . 

Thus  idom(w)  is  an  ancestor  of  w . The  path  consisting  of  the  tree 

path  from  r to  v followed  by  a path  v = v^,  v^,  * • * ^ that 

number(v. ) > number(w)  for  1 < i < k-1  (which  must  exist  by  (1)) 
avoids  all  proper  descendants  of  v which  are  also  proper  ancestors 
of  w . It  follows  that  idom(w)  is  an  ancestor  of  v . □ 

Corollary  1.  For  any  vertex  -w  ^ r , idom(w)  -*  w . 

Lemma  4.  Let  vertices  v , w satisfy  v w in  T . Then  v -*  idcm(w) 

or  idom(w)  -•  idom(v)  . 

Proof.  Let  x be  any  proper  descendant  of  idom(v)  which  is  also  a 

proper  ancestor  of  v . By  Theorem  1 and  Corollary  1,  there  is  a path 

from  r to  V which  avoids  x . By  concatenating  this  path  with  the 

tree  path  from  v to  w , we  obtain  a path  from  r to  w which  avoids  X'  . 
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Thus  idcm(w)  must  be  either  a descendant  of  v or  an  ancestor  of 


idcm(v)  . □ 

Using  Lemmas  1 - U,  we  obtain  two  results  which  provide  a way  to 
ccmipute  immediate  dominators  from  semi-dominators. 

Theorem  2.  Let  w ^ r and  let  v be  the  vertex  such  that 
number(v)  = sdam(v)  . Suppose  no  vertex  u satisfies 
number Cu)  > number Cv)  , u - w , and  sdom(u)  < sdom(w)  . Then 
idom(w)  = V . 

Proof.  By  Lemma  3,  it  suffices  to  show  that  v dominates  w . Consider 
any  path  from  r to  w . Let  x be  the  last  vertex  on  this  path 
satisfying  number (x)  < number (v)  . if  there  is  no  such  x , then  v = r 
dominates  w . Otherwise,  let  y be  the  first  vertex  following  x on 
the  path  and  satisfying  v - y w . All  vertices  z following  x on 
the  path  but  preceeding  y must  satisfy  z > y by  Lemma  1 and  the 
choice  of  x and  y . Thus  sdom(y)  < number (x)  < number(v)  = sdcBi(w) 
By  the  hypothesis  of  the  theorem,  y cannot  be  a proper  descendant  of  v . 
Thus  y = V and  v lies  on  the  path.  Since  the  path  selected  was 
arbitrary,  v dominates  w , □ 

Theorem  3 . Let  w ^ r and  let  v be  the  vertex  such  that 
number (v)  = sdom(w)  . Let  u be  a vertex  for  which  sdom(u)  is  minimum 
among  vertices  satisfying  number (u)  > number (v)  and  u -*  w . Then 
sdcm(u)  < sdom(w)  and  idom(u)  = idom(w)  . 


Proof. 


Let  X be  the  vertex  such  that  v -♦  x -•  w . Then 
sdCTii(u)  < sdoni(x)  < number  (v)  = sdomCv)  . 

By  Lemma  3^  idom(v)  is  an  ancestor  of  v and  thus  a proper 
ancestor  of  u . Thus  by  Lemma  4 idom(v)  -*  Idom(u)  . To  prove 
idam(u)  = idom(w)  , it  stuffices  to  prove  that  idom(u)  dominates  w . 

Consider  any  path  from  r to  w . Let  x be  the  last  vertex  on 
this  path  satisfying  number(x)  < number (idem (u) ) . If  there  is  no 
such  X , then  idom(u)  = r dominates  v . Otherwise,  let  y be  the 

^ -if 

first  vertex  following  x on  the  path  and  satisfying  idam(u)  -*  y -•  w . 
All  vertices  z following  x on  the  path  but  preceding  y satisfy 
number (z)  > number (y)  by  Lemma  1 and  the  choice  of  x and  y.  Thus 
sdom(y)  < number (x)  . Since  number (idam(u) ) < sdom(u)  by  Lemma  5,  we 
have  sdom(y)  < number (x)  < nTmiber(idom(u) ) < sdom(u)  . 

By  the  definition  of  u , y cannot  be  a proper  descendant  of  v . 
Furthermore  y cannot  be  both  a proper  descendant  of  idan(u)  and  an 
ancestor  of  u , for  if  this  were  the  case  the  path  consisting  of  the 
tree  path  from  r to  sdom(y)  followed  by  a path  sdcm(y)  = Vq,  v^,  . . . , Vj^ 
such  that  mmber(v^)  > number  (y)  for  1 < i < k-1  followed  by  the  tree 
path  from  y to  u would  avoid  idom(u)  ; but  no  path  from  r to  u 
avoids  Idom(u)  . 

The  only  remaining  possibility  is  that  idom(u)  = y . Thus  idcm(u) 
lies  on  the  path  from  r to  w . Since  the  path  selected  was  arbitrary, 
idcmCu)  dominates  w . □ 
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Corollary  2.  Let  v ^ r and.  let  v be  the  vertex  such  that 
number (v)  = sdcm(v)  . Let  u be  a vertex  for  which  sdan(u)  is 
minimum  among  vertices  satisfying  number (u)  > nimber(v)  and  u - w . 
Then 


(2) 


idom(w) 


r V if  sdom(w)  = sdom(u)  , 

I idom(u)  otherwise. 


Proof.  Immediate  from  Theorems  2 and  3.  Q 

The  following  theorem  provides  a way  to  ccmpute  semi-dominators. 
Theorem  4.  For  any  vertex  w ^ r , 

(3)  sdctti(w)  = min(  fnimiberCv)  | (v,w)  e E and  number (v)  < number (w)l 

U fsdam(u)  ) number (u)  > number (w)  and  there  is 

/ * 
edge  (v,w)  such  that  u -»  v in  T})  . 


Proof.  Let  t equal  the  right  side  of  (3 ) . We  shall  first  prove  that 
sdam(w)  < i . Suppose  f a number (v)  for  some  vertex  v such  that 
(v,w)  e E and  number (v)  < rnm*er(w)  . By  (1)  sdom(w)  < 1 . Suppose 
on  the  other  hand  I = sdom(u)  for  some  vertex  u such  that 
number(u)  > number (w)  and  there  is  an  edge  (v,w)  such  that  u -*  v . 

Let  X be  the  vertex  such  that  number (x)  = sdom(u)  . By  (1)  there  is 
a path  X s ^ such  that  nu^er(v^)  > number  (u)  > nuinber(w) 


V = V 

"^k-l 


for  1 < i < J-1  . The  tree  path  u = Vj  -*  " •• 

satisfies  number (v^^)  > number (w)  > n-umberCw)  for  J < i < k-1  . Thus 
the  path  x = Vq,  v^, . . . , Vj^  ^ = v , Vj^  = w satisfies  number (v^^)  > number (w) 
for  1 < i < k-1  . By  (1),  sdom(v)  < number (x)  = sdom(u)  = f . 
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It  remains  for  us  to  prove  that  sdam(w)  > i . Let  x be  the 
vertex  such  that  number (x)  = sdom(w)  , and  let  x = v^.v, ,»..,v,  = w 
be  a simple  path  such  that  number(Vj^)  > number (v)  for  1 < i < k-1  . 

If  k = 1 , (x,w)  e E , and  number(x)  < number (v)  by  Lemma  2.  Thus 

sdam(v)  = number (x)  > i . Suppose  on  the  other  hand  that  k > 1 . 

■jf 

Let  J be  minimum  such  that  j > 1 and  v.  v,  , . Such  a J exist 

— J K-X 

since  k-1  is  a candidate  for  j . 

We  claim  number(v. ) > number(v.)  for  1 < i < J-1  . Suppose  to 
the  contrary  that  number (v^)  < number (v^)  for  some  i in  the  range 
1 < i < J-1  . Choose  the  i such  that  1 < i < J-1  and  number(Vj^) 

is  minimum.  By  Lemma  1,  v.  -*  v.  , -which  contradicts  the  choice  of  j 

X J 

This  proves  the  claim. 

The  claim  implies  sdomC-w)  = number(x)  > sdom(v  n)  > i . Thus 
•whether  k = 1 or  k > 1 we  have  sdam(w)  > i ^ and  the  theorem  is 
true.  □ 
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5.  A Fast  Dcminators  Algorithm. 


In  this  section  we  develop  an  algorithm  vdiich  uses  the  resuJ.ts  in 
Section  2 to  find  dcminators.  Earlier  versions  of  the  algorithm  appear 
in  [10,11,12];  the  version  we  present  is  refined  to  the  point  where  it  is 
as  simple  to  program  as  the  straightforward  algorithm  [2,7]  or  the  bit  vector 
algorithm  [3,U],  similar  in  speed  on  small  graphs,  and  much  faster  on  large  graphs. 

The  algorithm  consists  of  the  following  four  steps. 


Step  1. 


Step  2. 


Step  3. 


Step  4. 


Carry  out  a depth-first  search  of  the  problem  graph.  Number 
the  vertices  from  1 to  n as  they  are  reached  during  the 
search.  For  each  vertex  w , determine  the  set  pred(w)  of 
vertices  v such  that  (v,  w)  is  an  edge  and  the  vertex 
parent (w)  which  is  the  parent  of  w in  the  spanning  tree 
generated  by  the  search.  Initialize  the  variables  used  in 
succeeding  steps. 

Compute  the  semi -dcminators  of  all  vertices  by  applying  Theorem  k. 
Carry  out  the  computation  vertex-by- vertex  in  decreasing  order 
by  number. 

Implicitly  define  the  immediate  dominator  of  each  vertex  by 
applying  Corollary  2. 

Explicitly  define  the  immediate  dominator  of  each  vertex,  carrying 
out  the  computation  vert ex -by- vertex  in  increasing  order  by 
number. 


Here  is  an  Algol-like  version  of  Step  1. 
stepl;  n ;b  0; 

for  ea^  v c V ^ pred(v)  :=  P;  semi(v)  = 0 od; 
DFS(r) ; 
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step  1 uses  the  recursive  procedure  DFS,  defined  below,  to  carry 
out  the  depth-first  search.  The  procedure  assumes  that  succ(v)  is 
the  set  of  vertices  w such  that  (v,w)  e E . When  a vertex  v receives 
a number  i , the  procedure  assigns  semi(v)  :=  i and  vert ex (i)  ;=  v . 

procedure  DFS (vertex  v) ; 
begin 

semi(v)  :=  n :=  n+1; 
vertex (n)  ;=  v; 

comment  initialize  variables  for  steps  2,  5,  and  li; 
for  each  w e succ(v)  d^ 

if  semi(w)  = 0 then  parent (w)  :=  v;  DFS(w)  fi; 

add  V to  pred(w)  od 

* ■ ■ 

end  DFS; 

After  carrying  out  Step  1,  the  algorithm  carries  out  Steps  2 and  5 
simultaneously,  processing  the  vertices  v ^ r in  decreasing  order  by  n\miber. 
When  processing  a vertex  v , the  algorithm  computes  sdom(v)  by  applying 
Theorem  h.  Each  edge  (u,v)  is  examined,  if  nuiiiber(u)  < number (v)  , 
number(u)  is  a candidate  for  sdom(v)  . If  number(u)  > number (v)  , the 
algorithm  finds  a vertex  x of  minimum  sdom(x)  among  vertices  satisfying 
mmber(x)  > number (v)  and  x -♦  u ; sdom(x)  is  a candidate  for 
sdcgn(v)  . The  minimum  of  all  the  candidates  is  sdom(v)  . After 
computing  sdom(v)  , the  algorithm  assigns  semi(v)  :=  sdom(v)  and  adds 
vertex  v to  the  set  bucket (u)  , where  u is  the  vertex  such  that 
number(u)  = sdom(v)  . This  completes  Step  2 for  v . Note  that  before 
sdom(v)  is  found,  semi(v)  = number (v)  , and  after  sdcm(v)  is  found, 
semi(v)  = sdcm(v)  . 
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After  the  semi-dominator  of  v is  computed,  the  algorithm  empties 
bucket (parent (v) ) . For  each  vertex  w e bucket (parent (v) ) , the  algorithm 
finds  a vertex  u of  minimum  sdom(u)  among  vertices  satisfying 
number (u)  > numb er ( parent  ( v) ) and  u -•  w . If  sdom(u)  = sdom(v)  , 
then  by  Corollary  2 the  immediate  dominator  of  w is  parent (v)  , and 
the  algorithm  assigns  dam{v)  :=  parent (v)  . If  sdom(u)  < sdom(v)  , 
then  by  Corollary  2,  u and  w have  the  same  immediate  dominator,  and 
the  algorithm  assigns  dom(w)  :=  u . The  intent  of  this  assignment 
is  to  implicitly  define  the  immediate  dcminator  of  w to  be  the 
immediate  dominator  of  a vertex  with  smaller  semi-dominator  than  w . 

This  completes  Step  5 for  vertices  w c bucket (parent (v) ) . 

•X- 

Both  Step  2 and  Step  5 require  determining,  for  certain  paths  v -•  w 

* 

in  the  spanning  tree,  a vertex  u on  the  path  v -•  w having  minimum 
sdom(u)  . To  find  such  vertices  the  algorithm  uses  a method  described 
in  [10],  The  algorithm  maintains  a data  structure  -which  represents  a 
forest  with  vertex  set  V and  edge  set  ( (parent (v),v)  | sdoim(v)  has  been 
computed}  . 
procedures : 

LINK(v,w) : 

EVAL(v) ; 


To  manipulate  this  data  structure,  the  algorithm  uses  two 

Add  edge  (v,w)  to  the  forest. 

If  V is  the  root  of  a tree  in  the  forest,  re.um  v . 
Otherwise,  let  r be  the  root  of  the  tree  in  the  forest 
which  contains  v . Return  a vertex  u r of  minimum 
sdom(u)  on  the  path  r -•  v in  the  forest. 
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Here  is  an  Algol-like  version  of  Steps  2 and  5 which  uses  LINK 


and  EVAL. 


:=  semi(x)  od; 


comment  initialize  variables; 
for  i :=  n by  -1  until  2 do 
V :=  vertex(i) ; 

step2 ; for  each  u 6 pred(v)  do 

X ;=  EVAL(u);  if  send.(x)  < semi(v)  then  senii(v) 
LINK (parent (v) , v) ; 
add  V to  bucket (vertex ( semi (v) ) ; 
step^ ; fojr  each  v e bucket  (parent  (v) ) ^ 

delete  w from  bucket (parent (v) ) ; 
u :=  EVAL(w) ; 

dom(w)  :=  if  senii(u)  < semi  (parent  (v) ) then  u 

— ■'  ^ - r -1^  - 

else  parent (v)  fi  od  od; 


Step  4 examines  vertices  in  increasing  order  by  number,  filling  in 


the  immediate  dominators  not  explicitly  computed  by  Step  3.  Here  is  an 
Algol-like  version  of  Step  4. 


step4;  for  i :=  2 ■until  n do 
V :=  vertex (i) ; 

if  dom(v)  ^ vertex ( semi (v) ) then  dom(v)  ;=  dom(dam(v  )) 

This  completes  our  presentation  of  the  algorithm  except  for  the 
in^jlementation  of  LINK  and  EVAL.  Figure  4 illustrates  how  the  algorithm 
works . 

[Figure  4] 

Appendix  B contains  a conqjlete  Algol-like  version  of  the  algorithm, 
including  "variable  declarations  and  initialization.  Using  Theorem  4 and 
Corollary  2 it  is  not  hard  to  prove  that,  after  execution  of  the  algorithm, 
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dcEi(v)  = idom(v)  for  each  vertex  v jL  r , ass-uming  that  LINK  and  EVAL 
perform  as  claimed.  The  running  time  of  the  algorithm  is  O(nM-n) 
plus  time  for  n-1  LINK  and  m+-n-l  EVAL  instructions. 
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4.  Implementation  of  LINK  and  EVAL. 

Reference  [15]  provides  two  ways  to  implement  LINK  and  EVAL>  one 
simple  and  one  sophisticated.  We  shall  not  discuss  the  details  of  these 
methods  here,  but  merely  provide  Algol-like  implementations  of  LINK  and 
EVAL  which  are  adapted  from  [15]» 

The  simple  method  uses  path  compression  to  cajrry  out  EVAL.  To 
represent  the  forest  built  by  the  LINK  instructions  (henceforth  called 
the  forest),  the  algorithm  uses  two  arrays,  ancestor  and  label. 

Initially  ancestor (v)  = 0 and  label (v)  = v for  each  vertex  v . 

In  general  ancestor(v)  = 0 only  if  v is  a tree  root  in  the  forest; 
otherwise  ancestor(v)  is  an  ancestor  of  v in  the  forest. 

The  algorithm  maintains  the  labels  so  that  they  satisfy  the  following 
property.  Let  v be  any  vertex,  let  r be  the  root  of  the  tree  in 
the  forest  containing  v , and  let  v = ^ such 

that  ancestor (Vj^)  = p for  1 i ^ • Let  x be  a vertex  such 
that  sdcan(x)  is  minimum  among  vertices  xe  [label (v^)  | 1 < i < k]  . 

Then 


(*)  X is  a vertex  such  that  sdnm(x)  is  minimum  among  vertices  x 
+ * 

satisfying  r -*  x -*  v in  the  forest. 

To  carry  out  LINK(v,  w)  , the  algorithm  assigns  ancestor (w)  :=  v . 
To  carry  out  EVAL(v)  , the  algorithm  follows  ancestor  pointers  to 
determine  the  sequence  v = • • •>''^0  “ ^ such  that 

ancestor  (v^)  = p for  l<i<k.  If  v=:r,  v is  returned. 
Otherwise,  the  algorithm  performs  a path  compression  by  assigpiing 
ancestor (v^)  :=  r for  z < i < k , updating  labels  to  maintain  (*). 

Then  label(v)  is  returned.  Here  is  an  Algol- like  procedure  for  EVAL. 
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vertex  procedure  EVAL(v); 

if  ancestor(v)  = 0 then  EVAL  :=  v 

el^e  COMPRESS (v) ; EVAL  :=  label (v)  fi; 

Recursive  procedure  CCMPRESS,  which  carries  out  the  path  compression, 
is  defined  by 

procedure  C(MPRESS(v); 

comment  this  procedure  assumes  ancestor (v)  ^ 0; 

Lf  ancestor(ancestor(v) ) ^ 0 then 
CCMPRESS(ancestor(v) ) ; 

if  semi(label(ancestor(v) ) ) < semi (label (v) ) then 
label(v)  :=  label(ancestor(v) ) fi; 
ancestor(v)  :=  ancestor(ancestor(v) ) fi; 

The  time  required  for  n-1  LINKS  and  ratn-1  EVALs  using  this 
implementation  is  0(m  log  n)  [IJ].  Thus  the  simple  version  of  the 
dominators  algorithm  requires  0(m  log  n)  time. 

The  sophisticated  method  uses  path  compression  to  carry  out  the 
EVAL  instructions  but  implements  the  LINK  instruction  so  that  path 
compression  is  carried  out  only  on  balanced  trees.  See  [13].  The 
sophisticated  method  requires  two  additional  arrays,  size  and  child. 
Initially  size(v)  = 1 and  child (v)  = 0 for  all  vertices  v . Here 
are  Algol -like  implementations  of  EVAL  and  LINK  using  the  sophisticated 
method.  These  procedures  are  adapted  from  [15]. 


! 

ff 

P, 
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vertex  procedure  EVAL(v) ; 

comment  procedure  CCMPRESS  used  here  is  identical  to  that  in  the 
simple  method; 

if  ancestor (v)  = 0 then  EVAL  ;=  label  (v) 
else  CCMPRESS  (v); 

EVAL  :=  if  semi (label (ancestor (v) ) ) > semi(label(v) ) then  label (v) 
else  label(ancestor(v) ) fi  fi; 


procedure  LINK(v,  w) ; 
begin 

comment  this  procedure  assumes  for  convenience  that 
size(O)  = label(O)  = semi(O)  = 0; 


s :=  w; 

while  semi (label (w) ) < semi (label(child(s) ) ) do 

if  size(s) + size(child(child(s) ) ) > 2*  size(child(s)  ) then 
parent ( child ( s ) ) :=  s;  child(s)  ;=  child (child(s) ) 
else  size (child (s) ) :=  size(s) ; 

s :=  parent(s)  :=  child(s)  fi  od; 
label (s)  :=  label (v) ; 
size(v)  :=  size (v )->■  size (w) ; 

if  size(v)  < 2*  size(w)  then  s,child(v)  :=  child(v),s  fi; 
while  s 0 do  parent(s)  :=  v;  s :=  child(s)  od 
end  LINK; 


With  this  implementation,  the  time  required  for  o-l  LINKS  and  m+n-1 
EVALs  is  0(m  a(m,n))  , where  a is  a functional  inverse  of  Ackermann's 
function  [1],  defined  as  follows.  For  integers  i, J > 0 , let  A(i,0)  = 0 
if  i > 0 , A(0,  j)  =2'’  if  0 > 1 , A(i,l)  a A(i-1,2)  if  i > 1 , 

and  A(i, J)  = A(i-1, A(i,  j-1) ) if  i > 1 , j > 2 , Then 
a(m,n)  = min{i  > 1 | A(i,  ) > log^  n}  . Thus  the  sophisticated 

version  of  the  dominators  algorithm  requires  0(m  a(m,n))  time. 
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5 . Implementation  axtd 


lerimental  Results 


Exp 

• We  translated  both  versions  of  the  algorithm  as  contadned  in 
Appendix  B into  Algol  W and  ran  the  programs  on  a series  of  randomly 

* 

generated  program  flow  graphs.  Table  1 and  Figures  5 and  6 illustrate 
the  results.  The  sophisticated  version  beat  the  simple  version  on  al  1 
graphs  tested.  The  relative  difference  in  speed  was  between  5 and  25^j 
increasing  with  increasing  n . 

[Table  l] 

[Figure  5] 

[Figure  6] 

We  transcribed  the  Purdom  - Moore  algorithm  into  Algol  W and  ran  it 
and  the  sophisticated  version  of  our  algorithm  on  another  series  of 
program  flow  graphs.  Table  2 and  Figure  7 show  the  results.  Our  algorithm 
was  faster  on  all  graphs  tested  except  those  with  n = 8 . The 
Purdom -Moore  algorithm  rapidly  became  non-competitive  as  n increased. 

The  trade-off  point  was  about  n = 10  . 

[Table  2] 

[Figure  7] 

We  implemented  the  bit  vector  algorithm  using  a set  of  procedxires 
for  manipulating  multi-precision  bit  vectors.  (Algol  W allows  bit  vectors 
only  of  length  32  or  less.)  Table  3 gives  the  ruming  time  of  this 
algorithm  on  the  second  series  of  test  graphs,  and  Figure  8 compares  the 
running  times  of  the  bit  vector  algorithm  and  the  sophisticated  version 
of  our  algorithm.  The  speed  of  the  bit  vector  algorithm  varied  depending 
upon  the  number  of  passes  required,  but  it  was  silways  slower  than  the 
i fast  algorithm. 

i 

j 


I 


[Table  J] 

[Figure  8] 

There  are  several  ways  in  which  the  hit  vector  algorithm  can  be  made 
more  competitive.  First,  the  bit  vector  procedures  can  be  inserted 
in-line  to  save  the  overhead  of  procedure  calls.  We  made  this  change  and 
observed  a 33  - ^5^  speed-up.  The  corresponding  change  in  the  fast 
algorithm,  inserting  LINK  and  EVAL  in-line,  produced  a 20^  speed-up. 

These  changes  made  the  bit  vector  algorithm  almost  as  fast  as  our  algorithm 
on  graphs  of  less  than  32  vertices,  but  on  larger  graphs  the  bit  vector 
algorithm  remained  substantially  slower  than  our  algorithm.  See  Table  1, 
Table  U,  and  Figure  9. 

[Table  4] 

[Figure 

Second,  the  bit  vector  procedures  can  be  written  in  assembly  language. 
To  provide  a fair  comparison  with  the  fast  algorithm  it  would  be  necessary 
to  write  LINK  and  EVAL  in  assembly  language.  We  did  not  try  this  approach, 
but  we  believe  that  the  fast  algorithm  would  still  beat  the  bit  vector 
algorithm  on  graphs  of  moderate  size. 

Third,  use  of  the  bit  vector  algorithm  can  be  restricted  to  graphs 
known  to  be  reducible.  On  a reducible  graph  only  one  pass  of  the  bit 
vector  algorithm  is  necessary,  because  the  only  purpose  served  by  the 
second  pass  is  to  prove  that  the  bit  vectors  don’t  change,  a fact 
guaranteed  by  the  reducibility  of  the  graph.  We  believe  that  a one-pass 
in-line  bit  vector  algoritVim  would  be  competitivj  with  the  fast  algorithm 
on  reducible  graphs  of  moderate  size,  but  only  if  one  ignores  the  time 
needed  to  test  reducibility. 
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The  bit  vector  algorithm  has  two  disadvantages  not  possessed  by  the 

fast  algorithm.  First,  it  requires  0(n  ) storage,  which  may  be 

prohibitive  for  large  values  of  n . Second,  the  dominator  tree,  not 

the  dominator  relation,  is  required  for  many  kinds  of  global  flow  analysis 

[8,  lJ|],  but  the  bit  vector  algorithm  computes  only  the  dominator  relation. 

Computing  the  relation  from  the  tree  is  easy,  requiring  constant  time  per 

element  of  the  relation  or  0(n)  bit  vector  operations  total.  However, 

2 

computing  the  tree  from  bit  vectors  encoding  the  relation  requires  0(n  ) 
time  in  the  worst  case. 

We  can  summarize  the  good  and  bad  points  of  the  three  algorithms  as 
follows:  the  Purdom  - Moore  algorithm  is  easy  to  explain  and  easy  to 
program  but  slow  on  all  but  small  graphs.  The  bit  vector  algorithm  is 
equally  easy  to  explain  and  program,  faster  than  the  Purdom  - Moore  algorithm, 
but  not  competitive  in  speed  with  the  fast  algorithm  unless  it  is  run  on 
small  graphs  which  are  reducible  or  almost  reducible.  The  fast  algorithm 
is  much  harder  to  prove  correct  but  almost  as  easy  to  program  as  the  other 
two  algorithms,  competitive  in  speed  on  small  graphs,  and  much  faster  on 
large  graphs.  We  favor  some  version  of  the  fast  algorithm  for  practical 
applications. 

We  conclude  with  a few  comments  on  ways  to  improve  the  efficiency  of 
the  fast  algorithm.  One  can  speed  up  the  algorithm  by  rewriting  DFS  and 
CCMPESS  as  non-recursive  procedures  which  use  explicit  stacks.  One  can 
avoid  using  an  aioxiliary  stack  for  CCMPRESS  by  instead  using  a trick  of 
reversing  ancestor  pointers;  see  [12].  A similar  trick  allows  one  to  avoid 
the  use  of  an  auxiliary  stack  for  DFS.  One  can  save  seme  additional  storage 
by  combining  certain  arrays,  such  as  parent  and  ancestor.  These  modifications 
save  running  time  and  storaige  space,  but  only  at  the  expense  of  program  clarity. 
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Appendix  A;  Graph- Theoretic  Temlnology. 

A directed  graph  G = (V, E)  consists  of  a finite  set  V of 
vertices  and  a set  E of  ordered  pairs  (v,w)  of  distinct  vertices, 
called  edges.  If  (v,w)  is  an  edge,  w is  a successor  of  v and  v 
is  a predecessor  of  w . A graph  G^  = (V^,  E^)  is  a subgraph  of  G 
if  c V and  E^  c E . A path  p of  length  k from  v to  w 
in  G is  a sequence  of  vertices  p = (v  = • • • > such  that 

(v^,v^^^)  e E for  0 < i < k . The  path  is  simple  if  v^,  .,.,Vj^  are 
distinct  (except  possibly  "'''q  = path  is  a cycle  if  "'^’q  = • 

By  convention  there  is  a path  of  no  edges  from  every  vertex  to  itself 
but  a cycle  mast  contain  at  least  two  edges.  A graph  is  acyclic  if  it 
contains  no  cycles.  If  p^  = (u  = Uq,u^, ...,u^  = v)  is  a path  from  u 
to  V and  p = (v  = v^, v^, ...,v^  = w)  is  a path  from  v to  w , the 
path  p^  followed  by  p^  is  p^  (u  = Uq,u^,  . . .,u^  = v = v^, v^, . . .,v^  = w) 

A flow  graph  G = (V, E,  r)  is  a directed  graph  (V,  E)  with  a 
distinguished  start  vertex  r such  that  for  any  vertex  veV  there  is 
a path  from  r to  v . A program  flow  graph  is  a flow  graph  such  that 

each  vertex  has  exactly  two  successors.  A (directed,  rooted)  tree 

T = (V, E, r)  is  a flow  graph  such  that  |e|  = [vj-l  . The  start  vertex 
r is  the  root  of  the  tree.  Any  tree  is  acyclic,  and  if  v is  any  vertex 
in  a tree  T , there  is  a unique  path  from  r to  v . If  v and  w 
are  vertices  in  a tree  T and  there  is  a path  from  v to  w , then  v is 

an  ancestor  of  w and  w is  a descendant  of  v (denoted  by  v -♦  w ) . If 

in  addition  v w , then  v is  a proper  ancestor  of  w and  w is  a 
proper  descendant  of  v (denoted  by  v-w).  If  v-*w  and  (v,w) 
is  an  edge  of  T (denoted  by  v -•  w ),  then  v is  the  parent  of  w 
and  w is  a child  of  v . In  a tree  each  vertex  has  a unique  parent 
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(except  the  root,  which  has  no  parent).  If  G = (V, E)  is  a graph 
and  T = (V',E',r)  is  a tree  such  that  (V',E')  is  a subgraph  of  G 
and  V = V , then  T is  a spanning  tree  of  G . 


The  Complete  Dominators  Algorithm. 


Appendix  B: 

This  appendix  contains  a complete  listing  of  both  versions  of  the 
dominators  algorithm.  The  algorithm  assumes  that  the  vertex  set  of  the 
problem  graph  is  V = [v  | 1 < v < n}  . 


proced'^e  DOMINATORS  (integer  set  array  succ(l:  :n) ; integer  r,n; 

integer  array  dom(l; ;n) ) ; 


begin 

integer  array  parent,  ancestor,  [ child, ] vertex  (l;;n); 
integer  array  label,  semi  [, size j (0::n); 
integer  set  array  pred,  bucket  (l::n); 
integer  u,  v,  x; 


procedure  DFS(integer  v) ; 
begin 

semi(v)  :=  n :=  n+1; 

vert ex (n)  :=  label (v)  :=  v; 

ancestor(v)  :=  [ child (v)  ;=]  0; 

[size(v)  :=  1;] 

for  each  wesucc(v)  do 

if  semi(w)  = 0 then  parent (w)  ;=  v;  DFS(w)  fi; 
add  V to  pred(w)  od 

end  DFS; 


if  ancestor (ancestor (v) ) ^ 0 then 
COMPRESS (ancestor (v) ) ; 

if  semi (label(ancestor(v) ) ) < semi (label(v) ) then 
label(v)  :=  label(ancestor(v) ) fi; 
ancestor(v)  :=  ancestor (ancestor(v) ) fi; 

integer  procedtire  EVAL(integer  v); 

if  ancestor (v)  = 0 then  EVAL  :=  v 

els^e  COMPRESS (v);  EVAL  :=  label (v)  fi; 
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proG^d^^^  LINK(integer  v,w); 
ajicestor(v)  :=  v; 


stepl : for  v :=  1 -until  n do 

pred(v)  :=  bucket (v)  jS;  semi(v)  :=  0 od; 
n :=  0; 

DFS(r); 

[ size(O)  :=  label(O)  :=  semi (0)  :=  0;] 
for  i :=  n by  -1  until  2 do 

V :=  vertex (i) ; 

step2;  for  each  u e pred(v)  do 

X :=  EVAL(u);  if  semi(x)  < semi(v)  then  seini(v)  :=  semi(x)  od; 
LINK(parent(v),v) ; 
add  V to  bucket (vertex (semi (v) ) ) ; 
step3 : for  each  w e bucket (parent (v) ) do 

delete  w from  bucket (parent (v) ) ; 
u :=  EVAL(w); 

dom(-w)  :=  if  semi(u)  < semi  (parent  (v) ) th^n.  u 
else  parent (v)  fi  od  od; 

stepU:  i :s=  2 n do 

V :=  vertex(i) ; 

if  dcm(v)  4 vertex(semi(v) ) then  dcm(v)  :=  dom(dom(v))  od 
end  DOIOTATORS; 


The  simple  version  of  the  algorithm  consists  of  the  procedure  above, 

•with  everything  in  brackets  deleted.  The  sophisticated  version  of  the 
algorithm  consists  of  the  proced-ure  above,  with  everything  in  brackets 
included,  and  the  following  procedures  substituted  for  EVAL  and  LINK. 

integer  proced^e  EVAL(ii^ger  v); 

if  ancestor (v)  = 0 th^  EVAL  :=  label(v) 
else  COMPRESS (v) ; 

EVAL  :=  if  semi(label(ancestor(v) ) ) > semi (label (v) ) then  label(v) 
else  label (ancestor (v) ) fi  fi; 
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procedure  LINK(int^eger  v,w) ; 
begin  integer  s; 
s :=  w; 

vjhile  semi  (label (v))  < semi  ( label ( child  ( s ) ) ) do 

if  size(s)  + size(child(child(s)))  > 2*  size(child(s) ) th^ 
ancestor(cbild(s) ) s;  child(s)  ;*=  child ( child ( s ) ) 

else  size(child(s) ) :=  slze(s) ; 

s :=  ancestor (s)  :=  child(s)  fi  od; 
label(s)  :=  label (v) ; 
size(v)  ;=  size(v)  + size('w); 

if  size(v)  < 2*  size(w)  then  s,child(v)  ;=  child ( v), s fd; 
while  s 0 do  ancestor (s)  :=  v;  s :=  child(s)  od 
end  LINK; 
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Figure  2.  Dominator  tree  of  flow  graph  in  Figure  1. 
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Figure  3.  Depth- first  search  of  flow  grajh  in  Figure  1. 

Solid  edges  are  spanning  tree  edges,  dashed  edges  are 
non-tree  edges. 

First  number  in  parentheses  is  vertex  number,  second 


is  semi-dominator. 


Figure  1;.  Forest  maintained  by  LINK  and  EVAL  during  steps  2 and  3 of  the 
doninators  algorithm.  (Trees  in  the  forest  consisting  of 
single  vertices  are  not  shown. ) 

(a)  Before  vertex  H is  processed.  Candidates  for  sdom(H) 
are  9 = number  (E)  and  1 = minfsdoni(v)  | B v * L]  . 

(b)  Before  vertex  E is  processed.  Candidates  for  sdopi(E) 
are  8 = number(B)  and  1 = minfsdom(v)  | e ^ v ^ H}  . 

After  sdom(E)  = 1 is  computed,  bucket (B)  is  unloaded. 

At  this  time  D is  the  only  element  of  bucket  (b)  . 

A is  the  vertex  such  that  sdcin(A)  = minf  sdom(v)  | B ^ A * D}  . 
Since  sdom (A)  = 1 < sdoin(D)  = 8,  dcin(D)  is  assigned 
dom(D)  :=  A . Note  that  idcci(D)  = idoiii(A)  = R . 

(c)  After  E is  processed. 
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Figure  5.  Running  times  in  10  ^ seconds  of  the  singsle  and  sophisticated 
versions  of  the  fast  algorithm. 
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Table  1.  Riinning  times  in  10  seconds  of  the  sin^^le  and  sophisticated 
versions  of  the  fast  algorithm  (three  graphs  for  each  value 
of  n ). 
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Table  2.  Riuining  times  in  10  seconds  of  the  Pur dom  - Moore  algorithm 
and  the  sophisticated  version  of  the  fast  algorithm  (three 
graphs  for  each  value  of  n ). 
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Table  3*  Running  times  in  10  seconds  and  number  of  passes 
of  the  bit  vector  algorithm  (three  graphs  for  each 
value  of  n ). 
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in-line  bit  vector 
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Table  1+.  Running  times  in  10  seconds  and  number  of  passes 
of  the  in-line  bit  vector  algorithm  (three  graphs  for 
each  value  of  n ) . 


